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Abstract 


Our  wcrk  deals  with  the  analysis  of  the  queueing  delays 
of  buffered  multistage  Banyan  networks  of  multiprocessors. 
We  provide  tight  upper  bounds  on  the  mean  delays  of  the  se- 
cond stage  and  beyond,  in  the  case  of  infinite  buffers.  Our 
results  are  validated  by  simulations  performed  on  a  network 
simi.;lator  constructed  by  us.  The  analytic  work  for  network 
stages  beyond  the  first,  provides  a  partial  answer  to  open 
problems  posed  by  previous  research. 
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Introduction 


With  the  realistic  appearance  of  low  cost  microproces- 
sors, the  multiprocessor  architectures  became  very  attracti- 
ve. The  principal  characteristics  of  such  systems  are  the 
massive  parallelism  and  the  ability  of  each  processor  to  sha- 
re a  single  main  memory  or  a  set  of  memory  modules.  This  sha- 
ring capacity  is  provided  through  an  interconnection  network 
between  the  processors  and  the  memory  modules.  Among  the  dif- 
ferent physical  forms  available  for  the  network  are  the  ti- 
me-shared bus,  the  crossbar  switch  and  the  multistage  packet 
switching  networks,  such  as  the  Banyan  and  Delta  networks 
(see  e.g.  [GL,  73]  and  [P,  81]).  The  bus  has  a  very  limited 
transfer  rate,  especially  in  the  case  of  machines  with  thou- 
sands of  processors.  The  full  crossbar  is  not  only  very  ex- 
pensive but  it  also  requires  a  tremendous  amount  of  intercon- 
nections which  clashes  with  engineering  limitations.  The  mul- 
tistage networks  are  intermediate  cases,  with  small  cost  and 
good  performance.  They  have  been  adopted  recently  by  the  in- 
dustry (e.g.  the  RP3   machine  of  IBM,  [G,  84]).  Since  the  net- 
work is  an  important  component  of  the  multiprocessor  machine. 
It  is  important  to  have  a  solid  understanding  of  its  perfor- 
mance. In  most  of  the  proposed  designs  the  network  supports 
dynamic  access  from  each  processor  to  each  memory  module, 
and  the  traffic  through  the  network  consists  of  short  items 
(requests  to  memory  and  replies).  The  requests  are  being  dy- 
namically generated  independently  at  each  processor.  The 
pattern  of  requests  is  essentially  random  and  varies  rapidly. 

In  this  paper  we  analyze  the  queueing  delays  of  buffered 
multistage  interconnection  networks.  Previous  analyses  have 
been  reported  in  [KS,  83]  and  [KSW,  84]. 

They  mostly  concentrate  on  the  analysis  of  the  delays  in  the 
first  stage  and  provide  the  delay  distribution  under  uniform 
and  nonuniform  traffic.  The  delays  at  subsequent  stages  are 
estimated  there  by  simulations  and  there  are  several  conje- 
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ctures  about  them.  The  main  obstacle  is  that  the  distribu- 
tion of  packet  arrivals  is  not  time-independent  from  the  se- 
cond stage  on,  even  if  such  independence  is  assumed  for  the 
first  stage. 

Our  work  provides  analytical  tight  upper  bounds  for  the 
delays  at  the  second  stage  and  beyond.  It  indicates  that  de- 
lays tend  to  increase  from  the  first  to  the  successive  sta- 
ges. It  also  indicates  that  (for  any  fixed  packet  generation 
rate)  after  the  second  stage  there  is  no  notable  difference 
between  the  delay  times,  thus  giving  a  partial  positive  an- 
swer to  the  conjecture  and  experimental  results  of  [ KS ,  83]. 
Our  results  are  backed  up  by  the  simulations  we  did  on  a 
self-driven  network  simulator. 


2.  Networks  and  modelling  assumptions 

The  networks  we  consider  are  built  of  switches  connected 
by  unidirectional  lines. (In  actual  parallel  computers  it  is 
necessary  to  send  replies  back  through  the  network.  There  are 
many  ways  to  change  definitions  to  allow  that).  A  k  input,  k 
output  switch  can  receive  packets  at  each  of  its  k  input  ports 
and  send  them  through  each  of  its  k  output  ports.  A  network 
(see  [KS,  83])  is  a  labelled  digraph  where  nodes  are  of  the 
following  three  types: 

(i)    source  nodes  (indegree  0,  outdegree  1) 

(li)   sink  nodes  (indegree  1,  outdegree  0) 

(iii)  switches  (oositive  indegree  and  outdegree). 

Each  edge  represents  one  or  more  lines  going  from  a  node  to 
its  successor.  A  banyan  network  (see  [GL,  73])  is  a  network 
with  a  unique  path  from  each  source  to  each  sink  node.  An 
n-stage  banyan  network  is  a  banyan  network  whose  nodes  can  be 
arranged  in  stages,  with  all  the  source  nodes  connected  to 
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switches  at  the  first  stage,  and  all  the  outputs  at  stage  i 
connected  to  inputs  at  stage  i+1.  An  n-stage  rectangular  banyan 
network  of  degree  k  is  an  n-stage  banyan  network  built  of 
kxk  switches. 

We  restrict  our  analysis  to  oblivious  routing  algorithms, 
i.e.  algorithms  in  which  the  path  of  a  packet  through  the  net- 
work is  fixed  at  the  source  node  issuing  it.  The  path  can  be 
encoded  as  the  sequence  of  labels  of  the  successive  switch 
outputs  of  the  path  (path  descriptor) . 

Our  modelling  assumptions  are  those  usually  used  in  the 
literature  ([KS,  83],[P,81]).  Packets  are  generated  at  each 
source  node  by  independent,  identically  distributed  random 
processes.  Each  processor  generates  with  probability  p  at  each 
cycle  a  packet,  and  sends  a  generated  packet  with  equal  pro- 
bability to  any  sink  node  (uniform  access).  The  network  is 
assumed  to  be  synchronous  (discrete-time)  so  that  packets  C£.n 

be  sent  only  at  times  t  ,  2t  ,  3t  , . . .  where  t   is  the  net- 

c     c     c  c         

work  cycle  time.  We  assume  that  t   is  also  the  cycle  time  of 
each  switch.  For  the  analysis,  without  loss  of  generality,  we 
will  1 
lemma 


will  take  t  =1.  The  above  assumptions  imply  the  following 


Lemma  1   [KS,  83] 

Let  packets  be  generated  at  the  source  nodes  of  a  banyan 
network  by  independent,  identically  distributed  random  pro- 
cesses, that  uniformly  distribute  the  packets  over  all  of  the 
sink  nodes.  Assume  that  the  routing  logic  at  each  switch  is 
fair ,  i.e.  that  conflicts  are  randomly  resolved.  Then 

(a)  The  patterns  of  packet  arrivals  at  the  inputs  of  the 
same  switch  are  independent. 

(b)  Packets  arriving  at  an  input  of  a  switch  are  uni- 
formly distributed  over  the  outputs  of  that  switch. 

(c)  For  uniform  networks,  for  each  stage,  the  pattern  of 
packet  arrivals  at  the  inputs  of  that  stage  have  the 
same  distribution. 
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The  uniform  access  assumption  allows  us  to  represent  any 

kxk  switch  as  a  system  of  k  queues  working  in  parallel,  with 

a  deterministic  server  each  (of  service  time  equal  to  1).  Any 

packet  entering  any  of  the  k  inputs  of  the  switch,  goes  with 

probability  _1  to  any  of  the  (output)  queues  of  the  switch. 
k 


Figure  1  :  A  kxk  switch  model  with  buffer  size  b>_0 


3.  The  isolated  kxk  switch  and  the  B/D/1  queue . 

Definition    A  B/D/1  queue  is  a  queue  whose  input  (arri- 
val) process  is  a  Bernoulli  process  B(q,N)  where  N  is  the 
number  of  trials  and  q  is  the  success  probability,  i.e.  whe- 
re the  Prob{x  packets  arrive  at  the  queue  at  the  next  unit 

N   X      N  —  x 
of  time}=(  )q  (1-q)    .  The  server  of  the  queue  is  determi- 
nistic (of  service  time  equal  to  1)  and  the  arrivals  are  con- 
sidered to  occur  just  before  the  end  of  the  corresponding  ti- 
me unit.  (Discrete  time  queue).  We  consider  the  length  of 
each  time  unit  tc  be  equal  to  1. 
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It  is  clear  that  any  given  output  queue'of  the  isolated 

p 
kxk  switch  can  be  modelled  by  a  B{— ,k)  queue,  since  with  pro- 

bability  p/k  a  packet  both  appears  in  a  given  input  and  mo- 
ves to  the  queue  under  discussion.  In  fact,  if  we  consider 
an  isolated  kxk  switch  (or  any  switch  of  the  first  stage  of 
the  network)  and  if  v . =number  of  arrivals  at  (output)  queue  i 
of  the  switch,  for  i=l,2,...,k,  then  the  total  number  of  ar- 
rivals in  the  inputs  of  the  switch  during  a  c^ycle,  is 

v  =  v,  +.  .  .  +v,        and 
1       k  ,         ,  _ 

Prob{v  =  n}=  ( "^  )  p"  ( 1-p  ) '^  "   if  0<k<n 

n       "^  —  — 

=  0    else 
Since  any  assignment  of  n  arriving  concurrently  messages  to 
the  output  queues  has  probability  (— )   (by  the  assumption  of 

K 

random  resolution  of  conflicts),  we  conclude  that  the  joint 
arrival  distribution  at  the  k  queues  is 

Prob{v^  =  a^,V2  =  a2 V|^  =  aj^/a^+,  .  .+a|^  =  n}  = 


a^la^l.-.a^l 


k 


(EQ  1 


It  is  easy  to  prove  that 

Lemma  2  (Due  to  M.  Snir,  proof  in  the  full  paper) 

The  marginal  densities  Prob{v.=a.}  of  ( EQ  1)  are  the  sa- 

p        '  1   i' 

me  with  the  Bernoulli  B(— ,k),  assumed  in  the  analysis  of  the 

k  -^ 

B/D/1  queue. 

Definition   Let  q   be  the  number  of  packets  in  the  B/D/1 

queue  with  input  B(p/k,k)  at  the  end  of  cycle  n. 

Definition   Let  v^  be  the  number  of  piickets  joining  the 

p 
B/D/1  queue  of  input  B(-,k)  at  cycle  n. 

Definition   Let  A(k)=0  if  k<0  and  A{k)=l  else. 
It  is  clear  that 


q    =q  -A(q  )+v   . 
^n+1  ^n    ^n    n+1 


(EQ  2) 


Assuming  a  steady  state  distribution  q=lim  q   (which  always 

n 
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exists  if  p<l)  we  have  (by  taking  means) 
E(q)=E(q)-E(A(q) ) +E ( v ) 


implying 


E{A(q) )=E( v) 


(EQ  3 


But  V  has  the  same  distribution  with  v   (since  the  ar- 

n 


rival  process  is  net  time-dependent)  i.e. 


E( v)=E( V  )=kf  =P 
n    k 


However 


E(A{q))=i  A(k)Prob(q=k)=Prob(q>0 
k  =  0 

Comparing  EQ  3 ,  EQ  4 ,  EQ  5  we  have 


(EQ  4) 
(EQ  5) 


Lemma  3  The  utilization  of  the  B/D/1  queue  of  input 
B(p/k,k)  IS  equal  to  p.  L_J 

We  can  also  easily  prove  that 

Lemma  4  The  steady  state  mean  queue  length  of  the  B/D/1 
queue  (i.e.  of  any  output  queue  of  the  first  stage  of  th.e 
network)  is 

(EQ  6) 


^'^'•P^  ^T^pf 


(The  above  equation  includes  the  packet  in  service). 

Proof  sketch 

EQ  6  wa.s  derived  in  [KS,  83].  The  crucial  steps  to  get 

It  is  (1)  the  method  of  mome^nts  of  [KLEI,  75]  which  proceeds 

by  squaring  EC  2  and  then  taking  means  and  limits,  and  (2) 

the  fact  that  v   ^  is  independent  of  v. ,  i<n+l,  (and  thus  cf 
n  +  1         '^  i' 

q  )  ,  only  because  of  the  assumption  about  packet  generation 
in  the  processors. 


D 

It  is  an  easy  corollary  of  the  above  that  the  mean  delay 


in  the  first  stage  is 


W 


queue   2(l-p) 


,  in  number  of  cycles 


4.  The  queuing  delays  of  the  second  stage. 


By  symmetry,  it  is  enough  to  analyze  the  first  output 
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queue,  Q  ,  of  a  particular  switch  S  of  the  second  stage. 

Let  B-,  be  the  number  of  customers  in  Q   at  cycle  n. 

Let  X.    be  the  (possible)  arrival  of  a  packet  from  in- 
put i  to  Q    during  cycle  n+1. 

Let   q^^,...,q^,  be  the  number  of  customers  in  the  queues 


'11 

■St  sta 

during  cycle  n.  (See  Figure  2) 


(of  the  first  stage)  which  feed  the  sv/itch  S  containing  Q 


a, 


Sw.-tcV,  S 


Figure  2 :   The  second  stage. 


It  is  clear  that 

„n+l   n  .,„n,  ,  n+1       n+1, 
6    =B, -A( B  )+( X,   +. . .+x,    ) 
1111         k 
where  for  each  i 

x^'^~  =  L{q^  .  )       with  probability  r- 
1      ^li         ^  ^    k 


(EQ  7) 


=  0 
By  Lemma  1 ,  the  x 


else 
n  +  1 
1   . 


other,  also  the  q 


"+1       J     J   ^   ^ 
,x     are  idependent  of  each 


]1 


,q-|.  are  independent  of  each  other 


However  each   x.    depends  on  each  of  q,  .  and  thus  the 

B-i  and  x.    are  not  independent  random  variables. 

1       1  '^ 


Definition 


n  +  1 


Let  y  =  the  size  of  B,  given  that  x.   =1 

Let  (p"(a)=Prob{y"  =  a} 

1  1 
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We  need  a  rather  technical  lemma; 
Lemma  5   For  each  i,n  we  have: 


V, 


and 


„,„n   n  +  1 ,  _   ,  r  n+1  ,■,  _,  n, 
E(B-,*x.   )=Prob{x.   =l}'E{y.) 
1   1  1  1 


E(A(6")-x"+^)=Prob{x"^^  =  l}-(l-cp"(0)  ) 
1    1  1  1 


Proof  sketch 


_,„n   n  +  lx   „    „  ^      ^  r  J^  J   n  +  l„i 

E  (  B  T  X  .   )  =  la*  B'Prob  {  6.  =  a  and  x  .   =  B  1 

-'-   1       Q  ■'•         1 

Q.   B 

=  I    a*Prob{B,=ci  and  x    =1} 
a>0  1 

r^   i^rn  +  l,,^   vr„n   ,n  +  l, 
=  Z  a'Probtx.   =1 IProbt B, =a/x  .   =1 

a>U 


QED 


Similarly  for  the  second  equation 


n 


n 


Let  us  also  note  that  x.=lim  x.=A-    with  picb  1/k 

1       1   q,  .       ^ 
n-"      ^li 

=0     else , 

hence  E ( x . ) =Prob { x . >0 } =-   (by  Lemma  3)  V  i  and  thus 
1         1     k     -^ 

E ( x, + . . . +x  ) =p ,  implying  that  lim  B,  exists  if  p<l. 
i       k  _  _^   i 

By  taking,  then,  limits  and  expectations  in  (EC  7)  we 
conclude 


E( A(B,  )  )=E(x,  +  .  .  .+x,  )=p 
1       1       k 


I.e. 


Lemma  6 


The  utilization  of  the  queue  Q   of  S  is  p,  at  the  steady 


state , 


n 


The  above  also  imply  that  y.=lim  y.  exists  if  p<l. 

1  n-"   1 

By  squaring  ( EQ  7)  (method  of  moments)  and  using  Lemma  5  and 
then  by  taking  expectations  and  limits,  we  get 

k  k 

2  E(BT)-2g-  Z  E(y.  )=p+E(  (x  +.  .  .+x  )^)-2g    Z  (l-<p  (0)) 
1    k  ^^^    1         1       k      '^   i  =  l     1 

(EQ  7) 
(Note  that  one  must  use  A^(b")=A(b")  and  B^'A(b")=b"   also 
Lemma  1,  to  get  (EQ  7)). 
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Since  x  .  =x .  v  i  and  x.,x.  are  independent  for  i?^j  from 
11  1   J 

Lemma  1 ,  we  get 


E({x^+...+X|^)^)=k  E(x^)+2^^^^^2^  E^(x^) 


=p+k(k-l)  -^  (EQ  8) 

By  symmetry,  E(y. )  is  the  same  V  i  and  this  also  holds  for 
y  .  (0)  ,  hence  by  ( EQ  7) : 


Lemma  7 


2  E(B^)-2pE(y^)=  -^  P^  +  2p  <p^(0 


(EQ  9) 


D 


By  ncting  that  the  r.v.  y,  is  the  distribution  of  the 
size  of  queue  Q   as  seen  by  an  arriving  pa;cket  from  input  1, 
we  can  get  a  worst  c£:se  bound  fcr  this  distribution  (leading 
to  an  upper  bound  for  E(B,))  by  using  the  "bulk  arrival"  dis- 
tribution 

Y  =  the  size  of  S>      given  that 


X  =1  or  X  =1  or  ... 
and  $  (a)=Prob{Y  =a: 


or  X,  =1 
k 


Clearly  (as  we  will  show  immediately' 


E(Y^)  >  E(y^) 


leading  to 

2  E{6^)<  ^^  P^  +  2p  <p^(0)+2pE(Y^)       ( EQ  10) 

We  are  going  to  use  an  Operational  Approach  ([BE,  78]) 
argument  to  get  an  upper  bound  en  E(Y^).  Let  us  consider  the 
case  where  t^  is  an  instant  at  which  arrivals  start  to  happen 
at  Q   (no  such  arrivals  happened  at  t  -1).  Arrivals  will  con- 
tinue to  happen  for,  say,  X    cycles  and  then  no  arrivals  will 
happen  at  Q   for,  say,  p  cycles.  If  we    assume  that  the  mean 
number  of  arrivals  per  cycle  is  1  for  the  interval  of  X  cycles, 
then  we  get  a  worst  case  bound  for  Y, .  (See  Figure  3).  Let  q„ 
be  the  queue  size  of  Q   at  t.. 
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x^::^^-'^'^^ 


>Y 


'^  cycles 


Ney<-i. 


No  Q.t.tjL^4Cxl«, 


Figure  3:  Operational  Approach  argument. 
Clearly  E{Y^)=qQ-  j^  in  the  interval  ( t  t  +X+u  ] 
while 


E(3^)= 


^qQ+uqQ-(l+2+. . .+u: 


in  the  same  interval . 


=qn- 


U(U+1) 


0   2{A+u) 


So    E{Y.  )=E(3J  +  {t^^)  •  l-?^)-  (^-) 
1      1    X+u    A+1     2 


(EQ  11) 


But  ^  <1,  ^    <1 
A+u  -    A+1  — 


E(Y^)  <  E(3^)+E(i^ 


hence 


I.e. 


E(Yj^)  <  E{0  )  + 


E(u)-1 


(EQ  12) 


Since  the  x^,...,x^    are  mutually  inc'eceneent  and  since 
Prob  {x,=C}  =l-p/k  we  get 

Probino  arrivals  at  a  cycle}=(l-  ^)^ 
and,  hence, 

E(u)=(l-  ^)-^  (EQ  13) 

We  conclude 


Lemma  8 


E(Y^)  <  E(3^)+. 


(1-  #)-''-: 


By  using  EQ  10  then,  we  get 


E(B  )  < 
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^^p  +p(l-  ^)       -p+2p  9^(0) 


>i":- 


2(i-p: 


B 


ut  (P^(0)  <_  1-utilization  fo  queue  Q    since 


E(Y^)  >  E{B^) 


So 


E(B^)< 


k-1  2   , ,   Pn -k    ^  , T   X 
— j— p  +p(l-  ^)       -p+2p(l-p) 


2{l-p) 


I.e. 


Lemma  9 


p^d-  h         p[(l-  f)'*"-!] 
E(B,)  <  P^  ^(TT^f  +     2(l-p) 


(EQ  14 


i.e.  the  mean  queue  size  at  the  second  stage  is  upper  bounded 

by  the  sum  of  the  mean  queue  size  of  the  first  stage  and  the 

factor 

p[{l-  ^)"  -1] 


2(l-p) 


n 


The  result  of  Lemma  9  has  been  validated  by  our  simu- 
lations. For  small  p  it  gives  an  excellent  agreement. 

5.  The  queueinq  delays  at  subsequent  stages 


Let  us  now  inductively  assume  that,  for  each  output  queue 

Q  of  stage  m,  we  have  for  the  steady  state  queue  size  q   that 

m 

E{A(q  ))=p  and  that  ( EQ  14)  holds  for  that  queue.  This  holds 
for  stage  2,  as  we  showed  in  Chapter  4.  This  implies  that 

E(x.)=  ^  for  the  corresponding  x.  feeding  the  next  stage.  Sin- 
ce Lemma  1  holds  for  all  stages  and  since  the  rest  of  the 
analysis  of  Chapter  4  (especially  EQ  10,11,12,13)  do  not  as- 
sume anything  else  about  the  x  ,...,x   except  that  of  Lemma  1, 
we  conclude  that  we  can  validate  the  rest  of  the  equations 
completely  and  thus  prove  for  any  queue  Q'  of  stage  m+1  that 

its  steady  state  q   ^  satisfies  E(A(q   ,))=p  and  also  ( EQ  14). 

m+1  m+1 

By  induction  then,  we  get: 

Lemma  10  (For  traffic  from  processors  to  memories) 

(a)  The  mean  size,E(3),of  each  output  queue  of  any  switch  of 
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any  stage  i  >^2  ,  satisfies  "~^"- . 

p^d-  h      p[(i-  f)"*"-!]        '  ■   •■    '      - 

E(B)<  p+  -.^^ ^  +  ,./  ^ 

—   '^         2  ( 1-p)        2  (1-p) 

(b)  The  mean  queueing  delay  of  a  packet  in  any  stage  i>2  sa- 
tisfies 


W 


-  !"'^'  ,  '^-^'"'-^ 


queue  —  2 (1-p 


2{1-d: 


Remarks 


Our  simulations  validate  Lemma  10.  Indeed  mean  queue  si- 
zes increase  from  the  first  to  the  second  stage  and  change 
insignificantly   thereafter.  See  tables  of  Appendix.  The  ex- 
tention  of  our  results  to  the  case  of  queues  of  finite  size 
is  currently  under  investigation.  Although  researches  have 
argued  that  finite  buffers  behave  as  in  the  case  of  infinite 
buffers  when  their  size  exceeds  the  mean  queue  length,  (see 
[KS,  83]),  it  has  been  remarked  (in  the  design  of  RP3 )  that 
this  is  not  the  case  for  large  p.  We  pose  this  as  an  open 
problem . 
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APPENDIX 


V-- 


Our  self-driven  simulator  is  written  in  Pascal  and  runs 
on  a  VAX/750.  We  use  a  three-dimensional  array  to  represent 
the  network.  The  first  dimension  is  the  stage,  the  second  is 
the  switch  in  the  stage  and  the  third  is  the  input  of  the 
switch.  For  the  interconnection  between  stages  we  use  the 
formula  derived  in  [P,  81].  Our  random  number  generator  is 
of  the  modulo  type. 

TABLES 

Table      1 

Number  of  processors  =  512  (9  stages) 

2x2  switches 
probability  p=0.2 

steps  of  the  simulation  =  20,000 


stage 

^ 

mes-n 

queue  length, E(B,) 

conditional  mean  queue  length, E(y) 

1 

0.21370 

0.281040 

2 

0.21185 

0.285426 

3 

0.21445 

0.305997 

4 

0.21035 

0.304020 

5 

0.21215 

0.284211 

6 

0.21850 

0.303797 

7 

0.22010 

0.317907 

8 

0.21300 

0.295908 

9 

0.21390 

0.305008 
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Table 


•X:^. 


Number  of  processors  =  512  (9  stages) 

2x2  switches 

probability  p=0.4 

steps  of  the  simulation  =  20,000 


stage  H 

mezn 

queue  length,  E(B,) 

conditional  mean  queue  length, E(y  ) 

1 

0.46525 

0.549005 

2 

0.47075 

0.596522 

3 

0.47620 

0.615155 

4 

0.48165 

0.638999 

5 

0.47375 

0.615714 

6 

0.47385 

0.620073 

7 

0.49700 

0.646475 

8 

0.48940 

0.652619 

9 

0.4  8  7^15:> 

0.627983 

Table   3 

Number  of  processors  =  512  (9  stages 
2x2  switches 

probability  p=0.6 

steps  of  the  simulation  =  20.000 


stage  || 

mean 

queue  length, E(B,) 

conditional  mean  queue  length, E(y  ) 

1 

0.82845 

0.959728 

2 

0.87660 

1.074090 

3 

0.83835 

1.031580 

4 

0.84580 

1.056960 

5 

0.86805 

1.081350 

6 

0.86185 

1.084420 

7 

0.82795 

1.015700 

8 

0.86145 

1.053790 

9 

0.85410 

1.046780 
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